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dot draws directed graphs as hierarchies. Like its predecessor, dag , it is a Unix filter, makes 
good drawings, and runs quickly. Its important new features are node ports for drawing data 
structures with pointers; improved placement of nodes, edge splines and labels; cluster layouts; and 
an underlying file language for graph tools. Here is a reduced module dependency graph of the 
SML-NJ compiler. The layout took 3.5 seconds of user time on an HP-9000/730 computer. 
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1 Basic Graph Drawing 


dot draws directed graphs. It reads attributed graph text files and writes drawings, either as graph 
files or in a graphics language such as PostScript. 

dot takes four main steps in drawing a graph. Knowing about these helps you to understand 
what kind of layouts dot makes, and how you can modify its layouts. The first step assigns discrete 
ranks to nodes. In a top to bottom drawing, ranks determine Y coordinates. Edges that span more 
than one rank are broken into chains of “virtual” nodes and unit-length edges. The second step 
orders nodes within ranks to avoid crossings. The third step sets X coordinates of nodes to keep 
edges short. The last step routes edge splines. This is the same general approach as dag , which in 
turn builds on the work of Warfield [War77], Carpano [Car80] and Sugiyama [STT81]. We refer the 
reader to [GKNV93] for explanation of dot’s algorithms. 

dot’s graph language has three kinds of items: graphs, nodes, and edges. The main (outermost) 
graph can be graph (undirected) or a digraph (directed). Because dot makes layouts of directed 
graphs, all the examples in this user’s guide use digraph. We have written a separate layout utility, 
neato, to draw undirected graphs [Nor92]. Within a main graph, a subgraph defines a subset of 
nodes and edges. 

Figure 1 is an example graph in dot’s language. Line 1 gives the graph name and type. The 
following lines create nodes, edges, or subgraphs, and set attributes. Names may be C identifiers, 
numbers, or quoted C strings. Quotes protect punctuation or white space. 

A node is created the first time its name appears in the file. An edge is created when nodes are 
joined by the edge operator ->. In the example, line 2 makes edges from main to parse and from 
parse to execute. Running dot on this file (say graphl .dot) yields the drawing of figure 2 1 

$ dot -Tps graphl. dot -o graphl.ps 

The command line option -Tps selects PostScript (EPSF) output, graphl.ps may be printed, 
displayed by a PostScript viewer, or embedded in another document. 

It is often useful to adjust the representation or placement of nodes and edges in the layout. 
This is done by setting attributes of nodes, edges, or subgraphs in the input file. Attributes are 
name-value pairs of character strings. Figures 3 and 4 illustrate some layout attributes. In the 
listing of figure 3, line 2 sets the graph’s size to 4,4 (all dimensions are in inches). This attribute 
controls the bounding box- the drawing is scaled as necessary to fit. 

Node or edge attributes are set off in square brackets. In line 3, the node main is assigned shape 
box. The edge in line 4 is straightened by increasing its weight (the default is 1). The edge in line 
6 is drawn as a dotted line. Line 8 makes edges from execute to make_string and printf . In line 
10 the default edge color is set to red. This affects any edges created after this point in the file. 
Line 11 makes a bold edge labeled 100 times. In line 12, node make_string is given a multi-line 
label. Line 13 changes the default node to be a box filled with a shade of blue. The node compare 
inherits these values. 


1 Unlike dag, the .GS command is not needed. 
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i: digraph G { 

2: main -> parse -> execute; 

3: main -> init; 

4: main -> cleanup; 

5: execute -> make_string; 

6: execute -> printf 

7: init -> make_string; 

8: main -> printf; 

9: execute -> compare; 

10 : > 

Figure 1: Small graph 



Figure 2: Drawing of small graph 
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1: digraph G { 

2: size ="4,4"; 

3: main [shape=box] ; /* this is a comment */ 

4: main -> parse [weight=8] ; 

5: parse -> execute; 

6 : main -> init [style=dotted] ; 

7: main -> cleanup; 

8: execute -> { make_string; printf} 

9: init -> make_string; 

10 : edge [color=red] ; 

11: main -> printf [style=bold,label="100 times"]; 

12: make_string [label="make a\nstring"] ; 

13: node [shape=box,style=filled,color=".7 .3 1.0"]; 

14: execute -> compare; 

15: > 

Figure 3: Fancy graph 



Figure 4: Drawing of fancy graph 
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2 Drawing Attributes 

The complete list of attributes that affect graph drawing is summarized in table 1. 

2.1 Shapes and Labels 

By default, nodes are drawn with shape=ellipse, width=.75, height=.5, and labeled by the node 
name. Other common shapes (box, circle, etc.) are listed in table 1. The node shape plaintext 
is of particularly interest in that it draws a node without any outline, an important convention in 
some kinds of diagrams. When drawn, a node’s actual size is the greater of the requested size and 
the area needed for its text label. By default, edges are unlabeled. Node and edge labels can be 
set explicitly as shown n figure 4. Though it is convenient that nodes are labeled with their names 
by default, sometimes it is essential to set labels explicitly. For example, in drawing a file directory 
tree, one might have several directories named src, but each one must have a unique node identifier. 
The inode number or full path name are suitable unique identifiers. Then the label of each node can 
be set to the file name within its directory. 

In multi-line labels, \n, \1, \r terminate lines that are centered, or left or right justified. 2 Graphs 
and cluster subgraphs may also have labels. 

The default font is 14-point Times-Roman, in black. Other font families, sizes, and colors may 
be selected. Font names should be compatible with the target interpreter (usually PostScript). It 
is best to use only the standard font families Times, Helvetica, Courier, or Symbol as these are 
guaranteed to work with any target graphics language. For example, Times-Italic, Times-Bold, 
or Courier are portable, but AvanteGarde-DemiOblique is not. 

Nodes with shape record or polygon have special properties. Section 3 reviews some details of 
using records. Polygons are useful for many shapes that are not predefined. They are parameterized 
by number of sides, peripheries, orientation, skew, and distortion, as illustrated in figures 5 and 6. 
peripheries is the number of borders. For example, a doublecircle has 2 peripheries, orientation 
is clockwise rotation from the X axis in degrees, skew is a floating point number (usually between 
— 1.0 and 1.0) that distorts the shape by slanting it from top-to-bottom, for example, turning a box 
into a parallelogram, distortion shrinks from top-to-bottom, for example, turning a box into a 
trapezoid. 

Though there is a way to implement custom node shapes, the details are beyond the scope of 
this user’s guide. Please contact the authors for further information. 

2.2 Graphics Styles 

Nodes and edges have color and style attributes. A color value can be a hue-saturation-brightness 
triple (three floating point numbers between 0 and 1), or one of the colors names listed in Appendix 
B (borrowed from some version of the X window system). The numerical form is convenient for 
scripts or tools that automatically generate colors. Color name lookup case and puncutation and 
insensitive, so "warmgrey" and Warm_Grey are equivalent. 

We can offer a few hints regarding use of color in graph drawings. First, avoid using too many 
bright colors. A “rainbow effect” is confusing. It’s better to choose a narrower range of colors, or to 
vary saturation along with hue. Second, when nodes are filled with dark or very saturated colors, 
labels seem to be more readable with fontcolor=white and f ontname=Helvetica. (We also have 
PostScript functions for dot that create outline fonts from plain fonts.) Third, you can define your 


2 The escape \N is an internal symbol for node names. 
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Name 

Default 

Values 

Node Attributes 

color 

black 

node shape color 

fontcolor 

black 

type face color 

f ontname 

Times -Roman 

PostScript font family 

font size 

14 

point size of label 

height, width 

.5, .75 

height and width in inches 

label 

node name 

any string 

layer 

overlay range 

all, id or id:id 

shape 

ellipse 

ellipse, box, circle, doublecircle, diamond, 



plaintext, record, polygon, epsf 

shapef ile 


external EPSF file if epsf shape 

style 


graphics options, e.g. bold, dotted, filled 

Edge Attributes 

color 

black 

edge stroke color 

decorate 


if set, draws a line connecting labels with their edges 

dir 

forward 

forward, back, both, or none 

fontcolor 

black 

type face color 

f ontname 

Times-Roman 

PostScript font family 

f ontsize 

14 

point size of label 

id 


optional value to distinguish multiple edges 

label 


label, if not empty 

layer 

overlay range 

all, id or id:id 

minlen 

1 

minimum rank distance between head and tail 

style 


graphics options, e.g. bold, dotted, filled 

weight 

1 

integer reflecting importance of edge. 

Graph Attributes 

center 


when true, centers drawing on page 

clusterrank 

local 

may be global or none 

color 

black 

background or cluster outline color 

concentrate 


enables edge concentrators when TRUE 

fontcolor 

black 

type face color 

f ontname 

Times-Roman 

PostScript font family 

f ontsize 

14 

point size of label 

label 


any string 

layerseq 


id:id:id... 

margin 

.5, .5 

margin included in page 

mclimit 

1.0 

if set to /, adjusts mincross iterations by (f) 

nodesep 

.25 

separation between nodes, in inches. 

nslimit 


if set to /, bounds network simplex iterations by (f) (number 



of nodes) 

ordering 


out (for ordered edges) 

orientation 

portrait 

may be set to landscape 

page 


unit of pagination, e.g. 8.5,11 

rank 


same, min, or max 

rankdir 

TB 

LR (left to right) or TB (top to bottom) 

ranksep 

.75 

separation between ranks, in inches. 

ratio 


approximate aspect ratio desired, or fill 

size 


drawing bounding box, in inches 


Table 1: Drawing attributes 
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Figure 5: Example of polygonal shapes for nodes 


1: digraph G { 

2: a -> b -> c; 

3: b -> d; 

4: a [shape=polygon,sides=5,peripheries=3,color=blue_light,style=f illed] ; 

5: c [shape=polygon,sides=4,skew=.4,label="hello world"] 

6: d [shape=invtriangle] ; 

7: e [shape=polygon, sides=4,distortion= . 7] ; 

8 : > 


Figure 6: Listing of graph with polygonal shapes 



Drawing graphs with dot 


8 


own color space by redefining nodecolor, edgecolor, or graphcolor in a library file. For example, 
to use RGB colors, place the following line in a file lib.ps. 

/nodecolor {setrgbcolor} bind def 

Use the -1 command line option to load this file. 

dot -Tps -1 lib.ps file. dot -o file.ps 

style controls miscellaneous graphics features of nodes, edges, graphs or subgraphs. The style is 
a list of primitives with optional argument lists. The predefined primitives are filled solid dashed 
dotted bold and invis. filled when applied to nodes or clusters shades inside the boundary of 
the object using its color. If the color is not set, light grey is used as the default. 

User-defined style primitives can be implemented as custom PostScript procedures. Such prim- 
itives are executed inside the gsave context of a graph, node, or edge, before any of its marks are 
drawn. The arg lists are translated to PostScript notation. For example, a node with style="setlinewidth(8) " 
is drawn with a thick outline. Here, setlinewidth is a PostScript built-in, but user-defined 
PostScript procedures are called the same way. The definition of these procedures can be given 
in a library file loaded using -1 as shown above. 

Edges have a dir attribute to set arrowheads, dir may be forward (the default), back, both, 
or none. This refers only to where arrowheads are drawn, and does not change the underlying 
graph. For example, setting dir=back does not exchange the endpoints of a directed edge (unlike 
the dar/program) . 

2.3 Drawing Size and Spacing 

Often a drawing made with the default node sizes and separations is too big for the target printer 
or for the space allowed for a figure in a document. There are several ways to try to deal with this 
problem. First, we will review how dot computes the final layout size. 

A layout is initially made internally at its “natural” size, using default settings (unless ratio=compress 
was set, as described below). By default, nodes are at least .75 (inches) wide by .5 tall; fonts are 
14 points high; nodes are separated by at least .25 and ranks by .5. There is no bound on the size 
or aspect ratio of the drawing, so if the graph is large, the layout is also large. If you don’t specify 
size or ratio, then the natural size layout is printed. 

The easiest way to control the output size of the drawing is to set size=x, y in the graph file (or 
on the command line using -G). This determines the bounding box of the final layout. For example, 
size="7.5,10" fits on an 8.5x11 page (assuming the default page orientation) no matter how big 
the initial layout, ratio also affects layout size. There are a number of cases, depending on the 
settings of size and ratio. 

Case 1. ratio was not set. If the drawing already fits within the given size then nothing 
happens. Otherwise, the drawing is reduced uniformly enough to make the critical dimension fit. 

If ratio was set, there are four subcases. 

Case 2a. If ratio=x where x is a floating point number, then the drawing is stretched 
(adding whitespace) to achieve the requested ratio expressed as drawing width/height. For ex- 
ample, ratio=2.0 makes the drawing twice as wide as it is high. Then the layout is scaled using 
size as in Case 1. 
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Case 2b. If ratio=fill and siz e=x,y was set, then the drawing is stretched (adding whites- 
pace) to achieve the ratio x/y. The effect is that all of the bounding box given by size is filled. 
Then scaling is performed as in Case 1. 

Case 2c. If ratio=compress and size=x, y was set, then the initial layout is compressed to 
attempt to fit it fit it the given bounding box. This trades off layout quality, balance, and symmetry, 
to pack the layout more tightly. Then scaling is performed as in Case 1. 

Case 2d. If ratio=auto then size is ignored and dot computes an “ideal” size using the 
following heuristic: it first attempts to fit the drawing on one page by reducing to not less than 50% 
of its original size. Otherwise, the drawing is printed on multiple pages, using the full area of each 
page and not reducing under 50%. 

At this point, if page is not set, then the final layout is printed as one page. 

If page=x, y is set, then the layout is printed as a sequence of pages that can tiled or assembled 
into a mosaic. Common settings are page="8.5,ll" or page="ll , 17". These values refers to the 
size of the physical device, and are independent of landscape mode. For tiled layouts, you may 
find it helpful to set smaller margins (the default is margin=.5). Although you can set margin=0, 
unfortunately, many bitmap printers have an Internal hardware margin that cannot be overridden. 

If rotate=90 is set, then the layout is printed in landscape mode. The X axis of the layout 
would be along the Y axis of each page. This does not affect the dot’s interpretation of size, ratio, 
or ’’page. 

A common problem is that a large graph drawn at a small size yields unreadable node labels. 
To make larger labels, something has to give. There is a limit to the amount of readable text that 
can fit on one page. Often you can draw a smaller graph by extracting an interesting piece of the 
original graph before running dot. We have some tools that help with this. 

• sccmap - decompose into strongly connected components 

• tred - compute transitive reduction (remove edges implied by transitivity) 

• gpr - ”raph processor to select nodes or edges, and contract or remove the rest of the graph 

• unflatten - improve aspect ratio of trees by staggering the lengths of leaf edges 

With this in mind, here are some thing to try on a given graph: 

1. Increase the node f ontsize. 

2. Use smaller ranksep and nodesep. 

3. Use ratio=auto. 

4. Use ratio=compress and give a reasonable size. 

5. A sans serif font (such as Helvetica) may be more readable than Times when reduced. 

2.4 Node and Edge Placement 

Sometimes it is natural to make edges point from left to right instead of from top to bottom. 
If rankdir=LR in the top-level graph, the drawing is rotated in this way. TB (top to bottom) 
is the default. (BT seems potentially useful for drawing upward-directed graphs, but hasn’t been 
impelemented. In some graphs you could achieve the same effect by reversing the endpoints of edges 
and setting their dir=back.) 
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In graphs with time-lines, or in drawings that emphasize source and sink nodes, you may need to 
constrain rank assignments. The set of a subgraph may be set to minrank, maxrank, or same rank. 
This constrains the nodes in the subgraph. Figures 7 and 8 illustrate using subgraphs for controlling 
rank assignment. 

In some graphs, the left-to-right ordering of nodes is important. If a subgraph has ordering=out 
then out-edges within the subgraph having the same tail node fan out from left to right in their 
order of creation. 

Also, when nodes are constrained to the same rank, edges with non-zero weight between them 
are aimed across the rank in the same direction (left-to-right, or top-to-bottom in a rotated drawing) 
as far as possible. This fact may be exploited to adjust node ordering by placing invisible edges 
(style="invis") where needed. 

Fine tuning should be approached cautiously, dot works best when it can makes a layout without 
much “help” or interference in its placement of individual nodes and edges. Layouts can be adjusted 
somewhat by increasing the weight \ of certain edges, or by creating invisible edges or nodes using 
style=invis, and sometimes even by rearranging the order of nodes and edges in the file. But 
this can backfire because the layouts are not necessarily stable with respect to changes in the input 
graph. One last adjustment can invalidate all previous changes and make a very bad drawing. A 
future project we have in mind is to combine the mathematical layout techniques of dot with an 
interactive front end that allows user-defined hints and constraints. 


3 Node Ports 

A node port is a point where edges may attach to a node. (When an edge is not attached to a 
port, it is aimed at the node’s center.) A node with a port specifier has the syntax name:port. The 
names and bindings of ports can differ from one node to another, depending on shapes and other 
attributes. Presently only the record shape has ports. This shape represents a record as recursive 
lists of labeled boxes. A port refers to the center of one of the boxes. Ports are created by inserting 
the construct <portid> in a box label, as shown in figures 9 and 10. 

Figures 11 and 12 shows how recursive records are drawn. Vertical bars separate fields at the same 
level, while curly braces enclose subfield lists. Port identifiers are enclosed in angle brackets. (Literal 
braces, vertical bars, and angle brackets must be escaped.) Spaces are interpreted as separators 
between tokens (similar to the way most typesetting programs work) so they must be escaped if you 
want fixed or “hard” spaces. Also, note that records sometimes look better if their input height is 
set to a small value so the text labels dominate the actual size, as illustrated in figure 9. Otherwise 
the default node size (.75 by .5) is assumed, as in figure 11. 

The example of figures 13 and 14 uses left-to-right drawing in a layout of a hash table. 

3.1 Clusters 

A cluster is a subgraph placed in its own distinct rectangle of the layout. A subgraph is recognized 
as a cluster when its name has the prefix cluster (unless the graph’s clusterrank=none). Cluster 
labels, fonts, colors, and styles can be set in the usual way. Clusters are drawn by a recursive 
technique that computes a rank assignment and internal ordering of nodes within clusters. Figure 15 
through 17 are cluster layouts and the corresponding graph files. 
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digraph asde91 { 
ranksep=.75; 
size = "7. 5, 7. 5"; 

{ 

node [shape=plaintext , fontsize=16] ; 

/* the time-line graph */ 

past -> 1978 -> 1980 -> 1982 -> 1983 -> 1985 -> 1986 -> 

1987 -> 1988 -> 1989 -> 1990 -> "future"; 

/* ancestor programs */ 

"Bourne sh"; "make"; "SCCS"; "yacc"; "cron"; "Reiser cpp"; 

"Cshell" ; "emacs"; "build"; "vi"; "<curses>"; "RCS"; "C*"; 

> 

{ rank = same; 

"Software IS"; "Configuration Mgt" ; "Architecture fe Libraries"; 
"Process"; 

>; 

node [shape=box] ; 

{ ra nk = same; "past"; "SCCS"; "make"; "Bourne sh"; "yacc"; "cron"; > 

{ rank = same; 1978; "Reiser cpp"; "Cshell"; > 

{ rank = same; 1980; "build"; "emacs"; "vi" ; > 

{ rank = same; 1982; "RCS"; "<curses>"; "IMX"; "SYNED"; > 

{ rank = same; 1983; "ksh"; "IFS"; "TTU" ; > 

{ rank = same; 1985; "nmake"; "Peggy"; > 

{ rank = same; 1986; "C*"; "ncpp"; "ksh-i"; "<curses-i>"; "PG2"; > 

{ rank = same; 1987; "Ansi cpp"; "nmake 2.0"; "3D File System"; "f delta"; 
"DAG"; "CSAS";} 

{ rank = same; 1988; "CIA"; "SBCS"; "ksh-88" ; "PEGASUS/PHL" ; "PAX"; 
"backtalk" ; > 

{ rank = same; 1989; "CIA++"; "APP"; "SHIP"; "DataShare"; "ryacc"; 
"Mosaic"; } 

{ rank = same; 1990; "libft"; "CoShell"; "DIA"; "IFS-i"; "kyacc"; "sfio"; 
"yeast"; "ML-X"; "DOT"; > 

■[ rank = same; "future"; "Adv. Software Technology"; > 

"PEGASUS/PML" -> "ML-X"; 

"SCCS" -> "nmake"; 

"SCCS" -> "3D File System"; 

"SCCS" -> "RCS"; 

"make" -> "nmake"; 

"make" -> "build"; 

"Bourne sh" -> "Cshell"; 

"Bourne sh" -> "ksh"; 

"Reiser cpp" -> "ncpp"; 

"Cshell" -> "ksh"; 


Figure 7: Graph with constrained ranks 
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digraph g { 

node [shape = record, height=. 1] ; 
nodeO [label = "<f0> |<fl> G|<f2> "] 
nodel [label = "<f0> |<fl> E|<f2> "] 
node2 [label = "<f0> |<fl> B|<f2> "] 
node3 [label = "<f0> |<fl> F|<f2> "] 
node4 [label = "<f0> |<fl> R|<f2> "] 
node5 [label = "<f0> |<fl> H|<f2> "] 
node6 [label = "<f0> |<fl> Y|<f2> "] 
node7 [label = "<f0> |<fl> A|<f2> "] 
node8 [label = "<f0> |<fl> C|<f2> "] 
"node0":f2 -> "node4":fl; 

"nodeO" :f0 -> "nodel" :fl; 

"nodel" :f0 -> "node2":fl; 

"nodel" :f 2 -> "node3":fl; 

"node2" :f2 -> "node8":fl; 

"node2" :f0 -> "node7":fl; 

"node4" :f2 -> "node6":fl; 

"node4":f0 -> "node5":fl; 


> 


Figure 9: Binary search tree using records 



Figure 10: Drawing of binary search tree 
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digraph, struct s { 
node [shape=record] ; 

structl [shape=record, label="<f 0> left|<fl> middle 
struct2 [shape=record, label="<f 0> one|<fl> two"]; 
struct3 [shape=record,label="hello\nworld Kb |{c 
structl:fl -> struct2:f0; 
structl :f 2 -> struct3:here; 

} 


<f 2> right"] ; 

<here> d|e>| f}| g | h"] ; 


Figure 11: Records with nested fields 



Figure 12: Drawing of records 


3.2 Concentrators 

Setting concentrate=true on the top level graph enables an edge merging technique to reduce 
clutter in dense layouts. Edges are merged when they run parallel and have a common endpoint. A 
beneficial side-effect in fixed-sized layouts is that removal of these edges often permits larger, more 
readable labels. While dot’s concentrators look somewhat like Newbery’s [New89], they are found 
by searching the edges in the layout, not by detecting complete bipartite graphs in the underlying 
graph. Thus the dot approach runs much faster but doesn’t collapse as many edges as Newbery’s 
algorithm. 


4 Command Line Options 

By default, dot operates in filter mode, writing graphs in the input format with layout attributes 
appended. -Tps sets PostScript output. -Tpcl emits HPGL/2 with PCL-5 wrappers, for HP 
LaserWriters. -Thpgl emits pure HPGL for wide bed pen plotters. -Tmif emits FrameMaker MIF 
files. In this mode, graph layouts can be loaded into FrameMaker and edited manually. FrameMaker 
is limited to 8 basic colors. 

-Gname=value sets a graph attribute default value. Often it is convenient to set size, pagination, 
and related values on the command line rather than in the graph file. Note that file contents override 
command line arguments. -N or -E instead of -G set default node or edge attributes. 

-1 loads graphics library files. 

-o sets the output file. 
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1 : digraph G { 

2: nodesep=.05; 

3: rankdir=LR; 

4: node [shape=record,width=.l,height=.l] ; 

5: 

6: nodeO [label = "<f0> |<fl> |<f2> |<f3> |<f4> I <f 5> |<f6> I " ,height=2. 5] ; 

7 : node [width = 1.5]; 

8: nodel [label = "{<n> nl4 | 719 I <p> >"] ; 

9: node2 [label = "-[<n> al | 805 |<p> >"] ; 

10: node3 [label = "{<n> i9 | 718 |<p> }"] ; 

11: node4 [label = "{<n> e5 | 989 |<p> }"] ; 

12: node5 [label = "{<n> t20 I 959 |<p> }"] ; 

13: node6 [label = "{<n> ol5 I 794 | <p> >"] ; 

14: node7 [label = "{<n> sl9 I 659 |<p> >"] ; 

15: 

16: node0:f0 -> nodel :n; 

17: node0:fl -> node2:n; 

18: nodeO :f 2 -> node3:n; 

19: nodeO :f 5 -> node4:n; 

20: node0:f6 -> node5:n; 

21: node2:p -> node6:n; 

22: node4:p -> node7:n; 

23: > 

Figure 13: Hash table graph file 



Figure 14: Drawing of hash table 
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digraph G { 

subgraph clusterO { 

node [style=f illed, color=white] ; 

style=f illed; 

color=lightgrey ; 

aO -> al -> a2 -> a3; 

label = "process #1"; 

} 

subgraph clusterl { 

node [style=f illed] ; 
bO -> bl -> b2 -> b3; 
label = "process #2"; 
color=blue 

> 

start -> aO; 
start -> bO; 
al -> b3; 
b2 -> a3; 
a3 -> aO; 
a3 -> end; 
b3 -> end; 

start [shape=Mdiamond] ; 
end [shape=Msquare] ; 



Figure 15: Process diagram with clusters 
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Figure 16: Call graph with labeled clusters 
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1: digraph G { 

2: size="8,6"; ratio=fill; node [fonts ize=24] ; 

3: 

4: ciaf an->computef an; f an->increment ; computef an->f an; stringdup->fatal; 

5: main->exit; main->interp_err ; main->ciafan; main->fatal; main->malloc ; 

6: main->strcpy; main->getopt ; main->init_index; main->strlen; fan->fatal; 

7: fan->ref; fan->interp_err ; ciafan->def; fan->free; computef an->stdprintf; 

8: computef an- >get_sym_fields; fan->exit; fan->malloc; increment->strcmp; 

9: computef an- >malloc ; f an->stdsprintf ; fan->strlen; computefan->strcmp; 

10: computef an->realloc ; computefan->strlen; debug->sfprintf ; debug->strcat; 

11: stringdup->malloc; f atal->sfprintf ; stringdup->strcpy ; stringdup->strlen; 

12: fatal->exit; 

13: 

14: subgraph "cluster_error .h" { label="error.h"; interp.err; } 

15: 

16: subgraph "cluster_sf io .h" { label="sf io .h" ; sfprintf; > 

17: 

18: subgraph "cluster_ciafan.c" { label="ciaf an. c" ; ciafan; computefan; 

19 : increment ; } 

20 : 

21: subgraph "cluster_util.c" { label="util.c"; stringdup; fatal; debug; } 

22: 

23: subgraph "cluster.query .h" ■[ label="query .h" ; ref; def; > 

24: 

25: subgraph "cluster_f ield.h" { get_sym.f ields ; } 

26: 

27: subgraph "cluster.stdio.h" ■[ label="stdio .h" ; stdprintf; stdsprintf; } 

28: 

29: subgraph "cluster_<libc . a>" { getopt; > 

30: 

31: subgraph "cluster.stdlib.h" { label="stdlib.h" ; exit; malloc; free; realloc; } 

32: 

33: subgraph "cluster.main. c" { main; } 

34: 

35: subgraph "cluster.index.h" { init_index; > 

36: 

37: subgraph "cluster.string.h" { label="string.h" ; strcpy; strlen; strcmp; strcat; } 
38:} 


Figure 17: Call graph file 
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-v requests verbose output. In processing large layouts, the verbose messages may give some 
estimate of dot’s progress. 

-V prints the version number. 


5 Miscellaneous 

In the top-level graph heading, a graph may be declared a strict digraph. This forbids the creation 
of self-arcs and multi-edges; they are ignored in the input file. 

If a subgraph appears with a body more than once in a graph file, its contents are the union of 
all the nodes and edges. An edge id is an optional string for referencing an edge that was previously 
created. When set, the triple (tail node, head node, key) form a unique edge key. Otherwise, a new 
internal id is generated for each distinct edge between the same pair of nodes. An id may be any 
string. 


6 Conclusions 

dot produces nicer drawings than dag and has some features to help make more readable drawings. 
It is not as fast as dag. Since it still takes only a second or two on reasonable inputs, the new features 
more than compensate. 

In writing graph drawing programs, we have found that it does not take long to get the first 
drawings, but it takes a great deal of work to get truly good drawings. While there is still plenty 
of room for improvement in dot, we have accomplished our principal goals concerning aesthetics, 
performance and new features. Since the basic algorithms of dot work well, we have a good basis 
for further research into problems such as methods for drawing large graphs and on-line (animated) 
graph drawing. 
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A Graph File Grammar 

The following is an abstract grammar of graph files. Terminals are shown in typewriter font and 
nonterminals in italics. Angle brackets ( and ) indicate grouping when needed. Double-line brackets 
[ and ] enclose optional items. Vertical bars | separate alternatives. 


graph 

-4 

[strict] ( digraph | graph )id { stmt-list > 

stmt-list 

-4 

[stmt [;] [stmt-list ] ] 

stmt 

-4 

attr-stmt | node-stmt | edge-stmt \ subgraph \ id = id 

attr-stmt 

-4 

(graph | node | edge)[ [ attr-list ] ] 

attr-list 

-4 

id=id [attr-list ] 

node-stmt 

-4 

node-id [ opt-attrs ] 

node-id 

-4 

id [: id ] 

opt-attrs 

-4 

[ attr-list ] 

edge-stmt 

-4 

{node-id \ subgraph) edgeRHS [opt-attrs ] 

edgeRHS 

-4 

edgeop {node-id \ subgraph) [ edgeRHS ] 

subgraph 

-4 

[subgraph id\ { stmt-list > | subgraph id 


An id is any alphanumeric string not beginning with a digit, but possibly including underscores; 
or a number; or any quoted string possibly containing escaped quotes. 

An edgeop is -> in directed graphs and — in undirected graphs. 

Semicolons aid readability but are not required except in the rare case that a named subgraph 
with no body immediate preceeds an anonymous subgraph, because under precedence rules this 
sequence is parsed as a subgraph with a heading and a body. 


B Plain Output File Format (-Tplain) 

The “plain” output format of dot lists node and edge coordinates that are usually needed by front 
end programs, in a line-oriented style. 

The first line is: 

graph scalefactor bounding-boxjx boundingJboxjy 
All coordinates are in default units (1/72 of an inch), unsealed. 

The next group of lines list the nodes in the format: 
node name x y xsize ysize label-text 

The name is a unique identifier. If it contains whitespace or punctuation, it is quoted. 

The next group of lines list edges: 

edge tailname headname n x\ y\ X 2 yi x n y n opt-text opt-x opt.y 

n is the number of coordinate pairs that follow as Bezier spline control points. If the edge is labeled, 
then the label text and coordinates are listed as the rightmost three items on the line. 

The last line is always: 

stop 
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C Layout Attributes 

Layout coordinates are in the default PostScript coordinate system. Node coordinates refer to their 
center points. The edge spline is a list of 3n 4- 1 points, plus optional p s and optional p e points. 

The 3n + 1 points are the Bezier control points. Points Po,Pi,P 2 ,P 3 ^ are the first bezier spline, 
P 3 i P 4 > Ps ) P6 are the second, etc. 

The p s point is present if there’s an arrow at po. In this case the arrow is from po to point p s , 
where p s is actually on the node’s boundary and po is further away. If there is no arrow, p 0 is on 
the node’s boundary. Similarly, p e is for an arrow on the other endpoint of the edge. 

Currently, edge points are listed top-to-bottom (or left-to-right) regardless of the orientation of 
the edge. This may change. 


D Layers 

dot has a feature for drawing a parts of a single diagram on a sequence of overlapping “layers. ” 
Typically the layers are overhead transparencies. To activate this feature, one must set the graph’s 
global layerseq to a list of identifiers. A node and edge can be assigned to a a layer or range of 
layers, all is a reserved name for all layers (and can be used at either end of a range, e.g design : all 
or all: code). For example: 

layerseq = "spec:design:code:debug:ship"; 
node90 [layer = "code"] ; 
node91 [layer = "design : debug"] ; 
node90 -> node91 [layer = "all"] ; 
node92 [layer = "all: code"]; 

In a layered graph, if a node or edge has no layer assignment, but incident edges or nodes do, 
then its layer specification is inferred from these. To change the default so that nodes and edges 
with no layer appear on all layers, insert near the beginning of the graph file: 

node [layer=all] ; 
edge [layer=all] ; 

There is presently no way to specify a set of layers that are not a continuous range. 

When PostScript output is selected, the color sequence for layers is set in the array layercolorseq. 
This array is indexed starting from 1, and every element must be a 3-array to be interepreted as a 
color coordinate. The adventurous may learn further from reading dot’s PostScript output. 
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E Color Names 


Whites 

Reds 

Yellows 

turquoise[l-4] 

antiquewhite[l-4] 

coral[l-4] 

darkgoldenrod[l-4] 


azure[l-4] 

crimson 

gold[l-4] 

Blues 

bisque[l-4] 

darks almon 

goldenrod[l-4] 

aliceblue 

blanched almond 

deeppink[l-4] 

greenyellow 

blue[l-4] 

cornsilk[l-4] 

firebrick[l-4] 

lightgoldenrod[l-4] 

blueviolet 

floralwhite 

hotpink[l-4] 

lightgoldenrodyellow 

cadetblue[l-4] 

gainsboro 

indianred[l-4] 

lightyellow[l-4] 

cornflowerblue 

ghostwhite 

lightpink[l-4] 

palegoldenrod 

darkslateblue 

honeydew[l-4] 

lightsalmon[l-4] 

yellow[l-4] 

deepskyblue[l-4] 

ivory[l-4] 

maroon[l-4] 

yellowgreen 

dodgerblue[l-4] 

lavender 

mediumvioletred 


indigo 

lavenderblush[l-4] 

orangered[l-4] 

Greens 

lightblue[l-4] 

lemonchiffon[l-4] 

palevioletred[l-4] 

chartreuse[l-4] 

lightskyblue[l-4] 

linen 

pink[l-4] 

darkgreen 

lightslateblue[l-4] 

mintcream 

red[l-4] 

darkolivegreen[l-4] 

mediumblue 

mistyrose[l-4] 

salmon[l-4] 

darkseagreen[l-4] 

mediumslateblue 

moccasin 

tomato[l-4] 

forestgreen 

midnightblue 

navajowhite[l-4] 

violetred[l-4] 

green[l-4] 

navy 

oldlace 


greenyellow 

navyblue 

papayawhip 

Browns 

lawngreen 

powderblue 

peachpuff[l-4] 

beige 

lightseagreen 

royalblue[l-4] 

seashell[l-4] 

brown[l-4] 

limegreen 

skyblue[l-4] 

snow[l-4] 

burlywood[l-4] 

mediumseagreen 

slateblue[l-4] 

thistle[l-4] 

chocolate[l-4] 

mediumspringgreen 

steelblue[l-4] 

wheat [1-4] 

darkkhaki 

mintcream 


white 

khaki[l-4] 

olivedrab[l-4] 

Magentas 

whitesmoke 

peru 

palegreen[l-4] 

blueviolet 


rosybrown[l-4] 

seagreen[l-4] 

darkorchid[l-4] 

Greys 

saddlebrown 

springgreen[l-4] 

darkviolet 

darkslategray[l-4] 

sandybrown 

yellowgreen 

magenta[l-4] 

dimgray 

sienna[l-4] 


mediumorchid[l-4] 

gray 

tan[l-4] 

Cyans 

mediumpurple[l-4] 

gray[0-100] 


aquamarine[l-4] 

mediumvioletred 

lightgray 

Oranges 

cyan[l-4] 

orchid[l-4] 

lightslategray 

darkorange[l-4] 

darkturquoise 

palevioletred[l-4] 

slategray[l-4] 

orange[l-4] 

lightcyan[l-4] 

plum [1-4] 


orangered[l-4] 

mediumaquamarine 

purple[l-4] 

Blacks 


mediumturquoise 

violet 

black 


paleturquoise[l-4] 

violetred[l-4] 



